Extracting Size and Shape Information of Sound Source in an Optimal Auditory Processing Model
نویسندگان
چکیده
We hear phonemes pronounced by men, women and children as approximately the same although the length of the vocal tract varies considerably from group to group. At the same time, we can identify the speaker group. This suggests that we extract and separate the size and shape information of sound sources. The impulse response of the vocal tract is compressed or expanded in time when the length of the vocal tract is compressed or expanded proportionally with the same cross-area function. The compressed and dilated versions of the impulse response can be converted into the same distribution using the Mellin transform. In this paper we show that the Mellin transform can be applied to the stabilised wavelet transform that forms the basis of the Auditory Image Model (AIM) of processing in the auditory pathway. The combined processing normalizes source size information and produces a new, fruitful representation of source shape information, referred to as the “Mellin Image.” This “Stabilised Wavelet-Mellin Transform” (SWMT) also provides the mathematical framework for the derivation of the gammachirp auditory filterbank (Irino and Patterson, 1997).
منابع مشابه
Stabilised wavelet mellin transform: an auditory strategy for normalising sound-source size
We hear phonemes pronounced by men, women and children as approximately the same although the length of the vocal tract varies considerably from group to group. At the same time, we can identify the speaker group. This suggests that we extract and separate the size and shape information of sound sources. The impulse response of the vocal tract is compressed or expanded in time when the length o...
متن کاملSelective deficits in human audition: evidence from lesion studies
The human auditory cortex is the gateway to the most powerful and complex communication systems and yet relatively little is known about its functional organization as compared to the visual system. Several lines of evidence, predominantly from recent studies, indicate that sound recognition and sound localization are processed in two at least partially independent networks. Evidence from human...
متن کاملSound resynthesis from Auditory Mellin Image using STRAIGHT
We propose an Auditory VOCODER to resynthesize sound from the Auditory Mellin Image which is an auditory representation that segregates the size and shape information of incoming sound. The sound resynthesis part consists of three techniques: the STRAIGHT VOCODER [2], frequency-warping cepstral analysis [4,12], and nonlinear multivariate regression analysis (MRA). We explain these methods and t...
متن کاملSelective deficits in human audition: evidence from lesion studies
The human auditory cortex is the gateway to the most powerful and complex communication systems and yet relatively little is known about its functional organization as compared to the visual system. Several lines of evidence, predominantly from recent studies, indicate that sound recognition and sound localization are processed in two at least partially independent networks. Evidence from human...
متن کاملCalculation of the drop in sound pressure level and frequency analysis of aerospace engine test cell (Research Article)
Aerospace engines testing is a source of noise pollution and determining the low frequency acoustic characteristics of the test cell, plays an important role in optimally control of the sound field and reducing the level of sound pressure and pollution. In this study, the drop in average sound pressure level is numerically predicted by constructing a test cell according to ISO 140 standard. To ...
متن کامل